Dataset statistics
| Number of variables | 7 |
|---|---|
| Number of observations | 122265 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.5 MiB |
| Average record size in memory | 56.0 B |
Variable types
| Categorical | 3 |
|---|---|
| Numeric | 3 |
| DateTime | 1 |
row_id has a high cardinality: 122265 distinct values | High cardinality |
county has a high cardinality: 1871 distinct values | High cardinality |
state has a high cardinality: 51 distinct values | High cardinality |
cfips is highly overall correlated with state | High correlation |
microbusiness_density is highly overall correlated with active | High correlation |
active is highly overall correlated with microbusiness_density | High correlation |
state is highly overall correlated with cfips | High correlation |
row_id is uniformly distributed | Uniform |
row_id has unique values | Unique |
Reproduction
| Analysis started | 2023-01-08 06:05:05.504744 |
|---|---|
| Analysis finished | 2023-01-08 06:08:21.430547 |
| Duration | 3 minutes and 15.93 seconds |
| Software version | pandas-profiling vv3.6.2 |
| Download configuration | config.json |
row_id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 122265 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 955.3 KiB |
| 1001_2019-08-01 | 1 |
|---|---|
| 39099_2022-07-01 | 1 |
| 39101_2020-04-01 | 1 |
| 39101_2020-03-01 | 1 |
| 39101_2020-02-01 | 1 |
| Other values (122260) |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.899841 |
| Min length | 15 |
Characters and Unicode
| Total characters | 1943994 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 122265 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1001_2019-08-01 |
|---|---|
| 2nd row | 1001_2019-09-01 |
| 3rd row | 1001_2019-10-01 |
| 4th row | 1001_2019-11-01 |
| 5th row | 1001_2019-12-01 |
Common Values
| Value | Count | Frequency (%) |
| 1001_2019-08-01 | 1 | < 0.1% |
| 39099_2022-07-01 | 1 | < 0.1% |
| 39101_2020-04-01 | 1 | < 0.1% |
| 39101_2020-03-01 | 1 | < 0.1% |
| 39101_2020-02-01 | 1 | < 0.1% |
| 39101_2020-01-01 | 1 | < 0.1% |
| 39101_2019-12-01 | 1 | < 0.1% |
| 39101_2019-11-01 | 1 | < 0.1% |
| 39101_2019-10-01 | 1 | < 0.1% |
| 39101_2019-09-01 | 1 | < 0.1% |
| Other values (122255) | 122255 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 1001_2019-08-01 | 1 | < 0.1% |
| 1001_2020-12-01 | 1 | < 0.1% |
| 1001_2019-12-01 | 1 | < 0.1% |
| 1001_2020-01-01 | 1 | < 0.1% |
| 1001_2020-02-01 | 1 | < 0.1% |
| 1001_2020-03-01 | 1 | < 0.1% |
| 1001_2020-04-01 | 1 | < 0.1% |
| 1001_2020-05-01 | 1 | < 0.1% |
| 1001_2020-06-01 | 1 | < 0.1% |
| 1001_2020-07-01 | 1 | < 0.1% |
| Other values (122255) | 122255 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 488994 | |
| 1 | 342018 | |
| 2 | 336657 | |
| - | 244530 | |
| _ | 122265 | 6.3% |
| 3 | 78552 | 4.0% |
| 9 | 73143 | 3.8% |
| 5 | 67827 | 3.5% |
| 7 | 58506 | 3.0% |
| 4 | 54177 | 2.8% |
| Other values (2) | 77325 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1577199 | |
| Dash Punctuation | 244530 | 12.6% |
| Connector Punctuation | 122265 | 6.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 488994 | |
| 1 | 342018 | |
| 2 | 336657 | |
| 3 | 78552 | 5.0% |
| 9 | 73143 | 4.6% |
| 5 | 67827 | 4.3% |
| 7 | 58506 | 3.7% |
| 4 | 54177 | 3.4% |
| 8 | 43662 | 2.8% |
| 6 | 33663 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 244530 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 122265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1943994 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 488994 | |
| 1 | 342018 | |
| 2 | 336657 | |
| - | 244530 | |
| _ | 122265 | 6.3% |
| 3 | 78552 | 4.0% |
| 9 | 73143 | 3.8% |
| 5 | 67827 | 3.5% |
| 7 | 58506 | 3.0% |
| 4 | 54177 | 2.8% |
| Other values (2) | 77325 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1943994 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 488994 | |
| 1 | 342018 | |
| 2 | 336657 | |
| - | 244530 | |
| _ | 122265 | 6.3% |
| 3 | 78552 | 4.0% |
| 9 | 73143 | 3.8% |
| 5 | 67827 | 3.5% |
| 7 | 58506 | 3.0% |
| 4 | 54177 | 2.8% |
| Other values (2) | 77325 | 4.0% |
cfips
Real number (ℝ)
| Distinct | 3135 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30376.038 |
| Minimum | 1001 |
|---|---|
| Maximum | 56045 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 955.3 KiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 5095 |
| Q1 | 18177 |
| median | 29173 |
| Q3 | 45077 |
| 95-th percentile | 53065 |
| Maximum | 56045 |
| Range | 55044 |
| Interquartile range (IQR) | 26900 |
Descriptive statistics
| Standard deviation | 15143.509 |
|---|---|
| Coefficient of variation (CV) | 0.4985347 |
| Kurtosis | -1.0974534 |
| Mean | 30376.038 |
| Median Absolute Deviation (MAD) | 12012 |
| Skewness | -0.077451731 |
| Sum | 3.7139262 × 109 |
| Variance | 2.2932586 × 108 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1001 | 39 | < 0.1% |
| 39133 | 39 | < 0.1% |
| 39089 | 39 | < 0.1% |
| 39091 | 39 | < 0.1% |
| 39093 | 39 | < 0.1% |
| 39095 | 39 | < 0.1% |
| 39097 | 39 | < 0.1% |
| 39099 | 39 | < 0.1% |
| 39101 | 39 | < 0.1% |
| 39103 | 39 | < 0.1% |
| Other values (3125) | 121875 |
| Value | Count | Frequency (%) |
| 1001 | 39 | |
| 1003 | 39 | |
| 1005 | 39 | |
| 1007 | 39 | |
| 1009 | 39 | |
| 1011 | 39 | |
| 1013 | 39 | |
| 1015 | 39 | |
| 1017 | 39 | |
| 1019 | 39 |
| Value | Count | Frequency (%) |
| 56045 | 39 | |
| 56043 | 39 | |
| 56041 | 39 | |
| 56039 | 39 | |
| 56037 | 39 | |
| 56035 | 39 | |
| 56033 | 39 | |
| 56031 | 39 | |
| 56029 | 39 | |
| 56027 | 39 |
county
Categorical
| Distinct | 1871 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 955.3 KiB |
| Washington County | 1170 |
|---|---|
| Jefferson County | 975 |
| Franklin County | 936 |
| Lincoln County | 897 |
| Jackson County | 897 |
| Other values (1866) |
Length
| Max length | 33 |
|---|---|
| Median length | 28 |
| Mean length | 14.016906 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1713777 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Autauga County |
|---|---|
| 2nd row | Autauga County |
| 3rd row | Autauga County |
| 4th row | Autauga County |
| 5th row | Autauga County |
Common Values
| Value | Count | Frequency (%) |
| Washington County | 1170 | 1.0% |
| Jefferson County | 975 | 0.8% |
| Franklin County | 936 | 0.8% |
| Lincoln County | 897 | 0.7% |
| Jackson County | 897 | 0.7% |
| Madison County | 741 | 0.6% |
| Montgomery County | 702 | 0.6% |
| Clay County | 702 | 0.6% |
| Union County | 663 | 0.5% |
| Monroe County | 663 | 0.5% |
| Other values (1861) | 113919 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| county | 117195 | |
| parish | 2496 | 1.0% |
| city | 1716 | 0.7% |
| washington | 1209 | 0.5% |
| jefferson | 1092 | 0.4% |
| st | 1014 | 0.4% |
| franklin | 1014 | 0.4% |
| lincoln | 936 | 0.4% |
| jackson | 936 | 0.4% |
| madison | 780 | 0.3% |
| Other values (1857) | 125073 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 189813 | |
| o | 184587 | |
| t | 157482 | 9.2% |
| u | 139581 | 8.1% |
| C | 133224 | 7.8% |
| y | 132171 | 7.7% |
| 131196 | 7.7% | |
| a | 87360 | 5.1% |
| e | 84162 | 4.9% |
| r | 62478 | 3.6% |
| Other values (47) | 411723 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1327638 | |
| Uppercase Letter | 253500 | 14.8% |
| Space Separator | 131196 | 7.7% |
| Other Punctuation | 1209 | 0.1% |
| Dash Punctuation | 195 | < 0.1% |
| Math Symbol | 39 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 189813 | |
| o | 184587 | |
| t | 157482 | |
| u | 139581 | |
| y | 132171 | |
| a | 87360 | |
| e | 84162 | |
| r | 62478 | 4.7% |
| i | 49179 | 3.7% |
| l | 49179 | 3.7% |
| Other values (16) | 191646 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 133224 | |
| M | 12090 | 4.8% |
| S | 11037 | 4.4% |
| P | 10218 | 4.0% |
| B | 10140 | 4.0% |
| W | 8775 | 3.5% |
| L | 8697 | 3.4% |
| H | 7722 | 3.0% |
| G | 6084 | 2.4% |
| D | 5694 | 2.2% |
| Other values (16) | 39819 | 15.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1053 | |
| ' | 156 | 12.9% |
Space Separator
| Value | Count | Frequency (%) |
| 131196 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 195 |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 39 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1581138 | |
| Common | 132639 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 189813 | |
| o | 184587 | |
| t | 157482 | |
| u | 139581 | 8.8% |
| C | 133224 | 8.4% |
| y | 132171 | 8.4% |
| a | 87360 | 5.5% |
| e | 84162 | 5.3% |
| r | 62478 | 4.0% |
| i | 49179 | 3.1% |
| Other values (42) | 361101 |
Common
| Value | Count | Frequency (%) |
| 131196 | ||
| . | 1053 | 0.8% |
| - | 195 | 0.1% |
| ' | 156 | 0.1% |
| ± | 39 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1713699 | |
| Latin 1 Sup | 78 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 189813 | |
| o | 184587 | |
| t | 157482 | 9.2% |
| u | 139581 | 8.1% |
| C | 133224 | 7.8% |
| y | 132171 | 7.7% |
| 131196 | 7.7% | |
| a | 87360 | 5.1% |
| e | 84162 | 4.9% |
| r | 62478 | 3.6% |
| Other values (45) | 411645 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Ã | 39 | |
| ± | 39 |
state
Categorical
HIGH CARDINALITY  HIGH CORRELATION 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 955.3 KiB |
| Texas | |
|---|---|
| Georgia | 6201 |
| Virginia | 5070 |
| Kentucky | 4680 |
| Missouri | 4485 |
| Other values (46) |
Length
| Max length | 20 |
|---|---|
| Median length | 13 |
| Mean length | 8.0810207 |
| Min length | 4 |
Characters and Unicode
| Total characters | 988026 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alabama |
|---|---|
| 2nd row | Alabama |
| 3rd row | Alabama |
| 4th row | Alabama |
| 5th row | Alabama |
Common Values
| Value | Count | Frequency (%) |
| Texas | 9906 | 8.1% |
| Georgia | 6201 | 5.1% |
| Virginia | 5070 | 4.1% |
| Kentucky | 4680 | 3.8% |
| Missouri | 4485 | 3.7% |
| Kansas | 4095 | 3.3% |
| Illinois | 3978 | 3.3% |
| North Carolina | 3900 | 3.2% |
| Iowa | 3861 | 3.2% |
| Tennessee | 3705 | 3.0% |
| Other values (41) | 72384 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| texas | 9906 | 7.1% |
| virginia | 7215 | 5.2% |
| georgia | 6201 | 4.4% |
| north | 5967 | 4.3% |
| carolina | 5694 | 4.1% |
| new | 4914 | 3.5% |
| kentucky | 4680 | 3.3% |
| dakota | 4602 | 3.3% |
| missouri | 4485 | 3.2% |
| south | 4329 | 3.1% |
| Other values (45) | 81900 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 134082 | |
| i | 106626 | 10.8% |
| n | 84630 | 8.6% |
| s | 83148 | 8.4% |
| o | 79755 | 8.1% |
| e | 60099 | 6.1% |
| r | 50700 | 5.1% |
| t | 32292 | 3.3% |
| l | 31590 | 3.2% |
| h | 25467 | 2.6% |
| Other values (36) | 299637 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 830544 | |
| Uppercase Letter | 139854 | 14.2% |
| Space Separator | 17628 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 134082 | |
| i | 106626 | |
| n | 84630 | |
| s | 83148 | |
| o | 79755 | |
| e | 60099 | |
| r | 50700 | 6.1% |
| t | 32292 | 3.9% |
| l | 31590 | 3.8% |
| h | 25467 | 3.1% |
| Other values (14) | 142155 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 19890 | |
| N | 15132 | |
| T | 13611 | |
| I | 13338 | |
| C | 10803 | 7.7% |
| K | 8775 | 6.3% |
| O | 7839 | 5.6% |
| V | 7761 | 5.5% |
| W | 7371 | 5.3% |
| A | 7176 | 5.1% |
| Other values (11) | 28158 |
Space Separator
| Value | Count | Frequency (%) |
| 17628 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 970398 | |
| Common | 17628 | 1.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 134082 | |
| i | 106626 | 11.0% |
| n | 84630 | 8.7% |
| s | 83148 | 8.6% |
| o | 79755 | 8.2% |
| e | 60099 | 6.2% |
| r | 50700 | 5.2% |
| t | 32292 | 3.3% |
| l | 31590 | 3.3% |
| h | 25467 | 2.6% |
| Other values (35) | 282009 |
Common
| Value | Count | Frequency (%) |
| 17628 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 988026 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 134082 | |
| i | 106626 | 10.8% |
| n | 84630 | 8.6% |
| s | 83148 | 8.4% |
| o | 79755 | 8.1% |
| e | 60099 | 6.1% |
| r | 50700 | 5.1% |
| t | 32292 | 3.3% |
| l | 31590 | 3.2% |
| h | 25467 | 2.6% |
| Other values (36) | 299637 |
| Distinct | 39 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 955.3 KiB |
| Minimum | 2019-01-08 00:00:00 |
|---|---|
| Maximum | 2022-01-10 00:00:00 |
Histogram with fixed size bins (bins=39)
microbusiness_density
Real number (ℝ)
| Distinct | 97122 |
|---|---|
| Distinct (%) | 79.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8176706 |
| Minimum | 0 |
|---|---|
| Maximum | 284.34003 |
| Zeros | 26 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 955.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.81756973 |
| Q1 | 1.6393442 |
| median | 2.5865433 |
| Q3 | 4.5192308 |
| 95-th percentile | 10.55302 |
| Maximum | 284.34003 |
| Range | 284.34003 |
| Interquartile range (IQR) | 2.8798866 |
Descriptive statistics
| Standard deviation | 4.9910868 |
|---|---|
| Coefficient of variation (CV) | 1.3073645 |
| Kurtosis | 556.68529 |
| Mean | 3.8176706 |
| Median Absolute Deviation (MAD) | 1.1668079 |
| Skewness | 15.970181 |
| Sum | 466767.49 |
| Variance | 24.910947 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 26 | < 0.1% |
| 1.8518518 | 20 | < 0.1% |
| 2.1276596 | 19 | < 0.1% |
| 1.6393442 | 19 | < 0.1% |
| 2.9940119 | 18 | < 0.1% |
| 0.97719872 | 18 | < 0.1% |
| 1.25 | 18 | < 0.1% |
| 0.93457943 | 17 | < 0.1% |
| 1.4925373 | 17 | < 0.1% |
| 1.369863 | 16 | < 0.1% |
| Other values (97112) | 122077 |
| Value | Count | Frequency (%) |
| 0 | 26 | |
| 0.063836582 | 12 | |
| 0.064516127 | 12 | |
| 0.066711143 | 1 | < 0.1% |
| 0.069662139 | 5 | < 0.1% |
| 0.08605852 | 1 | < 0.1% |
| 0.088652484 | 7 | < 0.1% |
| 0.10006671 | 9 | < 0.1% |
| 0.14338386 | 4 | < 0.1% |
| 0.15362556 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 284.34003 | 1 | |
| 277.53598 | 1 | |
| 227.75665 | 1 | |
| 224.53825 | 1 | |
| 217.58711 | 1 | |
| 217.25502 | 1 | |
| 217.1413 | 1 | |
| 210.0473 | 1 | |
| 208.22719 | 1 | |
| 206.80765 | 1 |
active
Real number (ℝ)
| Distinct | 19193 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6442.8582 |
| Minimum | 0 |
|---|---|
| Maximum | 1167744 |
| Zeros | 26 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 955.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 145 |
| median | 488 |
| Q3 | 2124 |
| 95-th percentile | 26306.2 |
| Maximum | 1167744 |
| Range | 1167744 |
| Interquartile range (IQR) | 1979 |
Descriptive statistics
| Standard deviation | 33040.012 |
|---|---|
| Coefficient of variation (CV) | 5.1281607 |
| Kurtosis | 471.82158 |
| Mean | 6442.8582 |
| Median Absolute Deviation (MAD) | 419 |
| Skewness | 17.572118 |
| Sum | 7.8773606 × 108 |
| Variance | 1.0916424 × 109 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 33 | 327 | 0.3% |
| 36 | 319 | 0.3% |
| 69 | 306 | 0.3% |
| 32 | 305 | 0.2% |
| 39 | 296 | 0.2% |
| 63 | 290 | 0.2% |
| 34 | 290 | 0.2% |
| 37 | 289 | 0.2% |
| 68 | 277 | 0.2% |
| 76 | 276 | 0.2% |
| Other values (19183) | 119290 |
| Value | Count | Frequency (%) |
| 0 | 26 | < 0.1% |
| 1 | 76 | |
| 2 | 120 | |
| 3 | 94 | |
| 4 | 62 | 0.1% |
| 5 | 164 | |
| 6 | 178 | |
| 7 | 119 | |
| 8 | 130 | |
| 9 | 121 |
| Value | Count | Frequency (%) |
| 1167744 | 1 | |
| 1160868 | 1 | |
| 1153292 | 1 | |
| 1152842 | 1 | |
| 1151836 | 1 | |
| 1150017 | 1 | |
| 1143527 | 1 | |
| 1142598 | 1 | |
| 1142034 | 1 | |
| 1141159 | 1 |
| cfips | microbusiness_density | active | state | |
|---|---|---|---|---|
| cfips | 1.000 | 0.127 | 0.073 | 0.987 |
| microbusiness_density | 0.127 | 1.000 | 0.783 | 0.086 |
| active | 0.073 | 0.783 | 1.000 | 0.146 |
| state | 0.987 | 0.086 | 0.146 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| row_id | cfips | county | state | first_day_of_month | microbusiness_density | active | |
|---|---|---|---|---|---|---|---|
| 0 | 1001_2019-08-01 | 1001 | Autauga County | Alabama | 2019-01-08 | 3.007682 | 1249 |
| 1 | 1001_2019-09-01 | 1001 | Autauga County | Alabama | 2019-01-09 | 2.884870 | 1198 |
| 2 | 1001_2019-10-01 | 1001 | Autauga County | Alabama | 2019-01-10 | 3.055843 | 1269 |
| 3 | 1001_2019-11-01 | 1001 | Autauga County | Alabama | 2019-01-11 | 2.993233 | 1243 |
| 4 | 1001_2019-12-01 | 1001 | Autauga County | Alabama | 2019-01-12 | 2.993233 | 1243 |
| 5 | 1001_2020-01-01 | 1001 | Autauga County | Alabama | 2020-01-01 | 2.969090 | 1242 |
| 6 | 1001_2020-02-01 | 1001 | Autauga County | Alabama | 2020-01-02 | 2.909326 | 1217 |
| 7 | 1001_2020-03-01 | 1001 | Autauga County | Alabama | 2020-01-03 | 2.933231 | 1227 |
| 8 | 1001_2020-04-01 | 1001 | Autauga County | Alabama | 2020-01-04 | 3.000167 | 1255 |
| 9 | 1001_2020-05-01 | 1001 | Autauga County | Alabama | 2020-01-05 | 3.004948 | 1257 |
| row_id | cfips | county | state | first_day_of_month | microbusiness_density | active | |
|---|---|---|---|---|---|---|---|
| 122255 | 56045_2022-01-01 | 56045 | Weston County | Wyoming | 2022-01-01 | 1.749688 | 98 |
| 122256 | 56045_2022-02-01 | 56045 | Weston County | Wyoming | 2022-01-02 | 1.749688 | 98 |
| 122257 | 56045_2022-03-01 | 56045 | Weston County | Wyoming | 2022-01-03 | 1.767542 | 99 |
| 122258 | 56045_2022-04-01 | 56045 | Weston County | Wyoming | 2022-01-04 | 1.767542 | 99 |
| 122259 | 56045_2022-05-01 | 56045 | Weston County | Wyoming | 2022-01-05 | 1.803249 | 101 |
| 122260 | 56045_2022-06-01 | 56045 | Weston County | Wyoming | 2022-01-06 | 1.803249 | 101 |
| 122261 | 56045_2022-07-01 | 56045 | Weston County | Wyoming | 2022-01-07 | 1.803249 | 101 |
| 122262 | 56045_2022-08-01 | 56045 | Weston County | Wyoming | 2022-01-08 | 1.785395 | 100 |
| 122263 | 56045_2022-09-01 | 56045 | Weston County | Wyoming | 2022-01-09 | 1.785395 | 100 |
| 122264 | 56045_2022-10-01 | 56045 | Weston County | Wyoming | 2022-01-10 | 1.785395 | 100 |